Cross-Media Retrieval via Semantic Entity Projection
نویسندگان
چکیده
Cross-media retrieval is becoming increasingly important nowadays. To address this challenging problem, most existing approaches project heterogeneous features into a unified feature space to facilitate their similarity computation. However, this unified feature space usually has no explicit semantic meanings, which might ignore the hints contained in the original media content, and thus is not able to fully measure the similarities among different media types. By considering the above issues, we propose a new approach to cross-media retrieval via semantic entity projection (SEP) in this paper. Our approach consists of three main steps. Firstly, an entity level with fine-grained semantics between low-level features and high-level concepts are constructed, so as to help bridge the semantic gap to a certain extent. Then, an entity projection is learned by minimizing both cross-media correlation error and single-media reconstruction error from low-level features to the entity level, with which a unified feature space with explicit semantic meanings can be obtained from low-level features. Finally, the semantic abstraction of high-level concepts is generated by using logistic regression to conduct cross-media retrieval. Experimental results on the Wikipedia dataset show the effectiveness of the proposed approach.
منابع مشابه
Cross Media Entity and Concept Driven Search
In recent years there is a rapid growth of unstructured text and multimedia content, which includes audio, video and image content on the web and within the enterprise. Handling these large volumes of data is cumbersome without effective methods for content analysis and retrieval. This paper presents Sensefy – a cross media information retrieval system, which uses higher-level semantic concepts...
متن کاملCross-media Retrieval Based on CSRN Clustering
A novel cross-media retrieval methodology is proposed to help user more accurately and naturally describe their requirement and gain response. The first problem to be solved in this paper is how to build the bridge among heterogeneous information. An efficient approach is adopted to mine the semantic relationship among different multimedia data. We propose the scheme, called cross-media semanti...
متن کاملCross-Language Image Retrieval via Spoken Query
This paper studies cross-language cross-medium information retrieval. We introduce several approaches to unify the languages and media of queries and documents. We experiment on cross-language image retrieval via spoken query. Two approaches are proposed to recognize and translate spoken queries. We also propose a similarity-based approach to identify and backward transliterate named entities i...
متن کاملMistral – Measurable, Intelligent and Reliable Semantic Extraction and Retrieval of Multimedia Data
Multimedia data has a rich and complex structure in terms of interand intra-document references. Its potential is severely limited unless effective methods for semantic extraction and semantic-based cross-media exploration and retrieval can be devised. Today’s leading-edge techniques in this area are working well for low-level feature extraction and, to a certain degree, for concept recognition...
متن کاملSemantic-Based Cross-Media Image Retrieval
In this paper, we propose a novel method for cross-media semantic-based information retrieval, which combines classical textbased and content-based image retrieval techniques. This semantic-based approach aims at determining the strong relationships between keywords (in the caption) and types of visual features associated with its typical images. These relationships are then used to retrieve im...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016